DNA encoding a thermostable DNA polymerase

ABSTRACT

A nucleic acid amplifying enzyme having a short reaction time and high fidelity is provided. The enzyme of this invention is a thermostable DNA polymerase having a nucleic acid extension rate of at least 30 bases per second and a 3&#39;-5&#39; exonuclease activity. Also provided are a method and kit for amplifying nucleic acid.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a division of pending application Ser. No. 09/073,259 filed May 6, 1998, which is a division of application Ser. No. 08/656,005 filed May 24, 1996, now U.S. Pat. No. 6,033,859 both of which are incorporated herein in their entirety by reference thereto.

FIELD OF THE INVENTION

The present invention relates to a method of amplifying nucleic acid wherein DNA or RNA is amplified within a short reaction time and with a high fidelity, to a method of identifying nucleic acid utilizing said amplifying method and to a DNA polymerase and a reagent kit used for those methods.

PRIOR ART

Many studies have been made already for DNA polymerase of mesophilic microorganism such as Escherichia coli and for DNA polymerase derived from phages infectable by the mesophilic microorganisms. In addition, many studies have been also made already for heat stable DNA polymerases which are useful in a recombinant DNA technique by means of nucleic acid amplification such-as a polymerase chain reaction (PCR). Examples of the heat-stable polymerases which are used for the PCR are DNA polymerase (Tth polymerase) mostly derived from Thermus thermophilus and DNA polymerase (Taq polymerase) derived from Thermus aguaticus. Other known examples are DNA polymerase (Pfu polymerase) derived from Pyrococcus furiosus and DNA polymerase (Vent polymerase) derived from Thermococcus litoralis.

PROBLEMS TO BE SOLVED BY THE INVENTION

However, with the Taq polymerase, fidelity and thermostability upon the synthesis of DNA are not sufficient. Although the Pfu polymerase exhibiting excellent fidelity and thermostability has been developed, said Pfu polymerase has some problems that its DNA extension rate is slow and a processivity is low whereby it has been used only for a specific PCR.

Recently, a PCR whereby 20 kb or more DNA is amplified (hereinafter, referred to as a long-PCR) has been developed. In said long-PCR, both Tag polymerase and Pfu polymerase are mixed whereby properties of both enzymes are utilized. However, when two enzymes having different properties are used in the same reaction system, some discrepancies might occur in their appropriate reaction conditions whereby there is a question whether the high extension rate and fidelity which are the advantages of each of those enzymes can be still maintained. Moreover, because of the difference in the thermostabilities and in the composition of the stock solutions of both enzymes, there is a question as to the stability when they are stored in the same container.

In view of the above, there has been a keen demand for novel thermostable polymerase which exhibits both of those advantages.

SUMMARY OF THE INVENTION

The present inventors have succeeded in preparing a thermostable DNA polymerase from a hyperthermophilic archaeon strain KOD1, and, when its properties are investigated, it has been found that s aid DNA polymerase exhibits the advantages of the above-mentioned two enzymes, i.e. high extension rate and high fidelity, whereby the present invention has been achieved.

Thus, the present invention relates to a method for amplifying a target nucleic acid comprising reacting the target nucleic acid with four kinds of DNTP and primer complementary to said target nucleic acid in a buffer solution which contains a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3′-5′ exonuclease activity such that the above mentioned primer is annealed to the target nucleic acid and an extention product is synthesized from the primer.

The present invention further relates to a method for amplifying a target nucleic acid in a sample wherein each target nucleic acid consists of two separate complementary strands which comprises the following steps A to D, characterized in that a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3′-5′ exonuclease activity is used as a thermostable DNA polymerase;

A: modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids;

B: reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, in a buffer solution which contains a thermostable DNA polymerase such that the above mentioned primers are annealed to the single-stranded nucleic acids and extention products are synthesized from the primers,

C: separating the primer extention products from the templates on which they are synthesized to produce single-stranded nucleic acids; and

D: repeatedly conducting the above mentioned steps B and C.

The present invention further relates to a method for detecting a target nucleic acid in a sample wherein each target nucleic acid consists of two separate complementary strands which comprises the following steps A to E, characterized in that a thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3′-5′ exonuclease activity is used as a thermostable DNA polymerase;

A: modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids;

B: reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, in a buffer solution which contains a thermostable DNA polymerase such that the above mentioned primers are annealed to the single-stranded nucleic acids and extention products are synthesized from the primers,

C: separating the primer extention products from the templates on which they are synthesized to produce single-stranded nucleic acids;

D: repeatedly conducting the above mentioned steps B and C, and

E: detecting an amplified nucleic acid.

The present invention further relates to a reagent kit for amplifying target nucleic acid which comprises primers, wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, four kinds of dNTP, divalent cation, thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3′-5′ exonuclease activity and buffer solution.

The present invention further relates to a reagent kit for detecting target nucleic acid which comprises primers, wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, four kinds of dNTP, divalent cation, thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and a 3′-5′ exonuclease activity, amplifying buffer solution, a probe capable of hybridizing with amplified nucleic acid and a detection buffer solution.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to a thermostable DNA polymerase which is obtainable from a strain KOD1 which belongs to a hyperthermophilic archaeon strain.

The present invention relates to an isolated DNA comprising a nucleotide sequence that encodes the thermostable DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaon

The present invention further relates to a recombinant DNA expression vector that comprises the DNA sequence inserted into a vector, wherein the DNA sequence encodes the thermostable DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon.

The present invention further relates to a transformed recombinant host cell using a recombinant DNA expression vector that comprises the DNA sequence inserted into a vector, wherein the DNA sequence encodes the thermostable DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon.

The present invention relates to a method for producing a DNA polymerase obtainable from a KOD1 strain which belongs to hyperthermophilic archaeon, comprising culturing recombinant host cells which are transformed by a recombinant DNA expression vector that comprises the DNA sequence inserted into a vector, wherein the DNA sequence encodes the thermostable DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon, and recovering the produced thermostable DNA polymerase.

The present invention further relates to a method for purifying the DNA polymerase obtainable from a KOD1 strain which belongs to hyperthermophilic archaeon, comprising culturing the recombinant host cells which are transformed by a recombinant DNA expression vector that comprises the DNA sequence inserted into a vector, wherein the DNA sequence encodes the thermostable DNA polymerase derived from a KOD1 strain which belongs to hyperthermophilic archaeon, and further (a) recovering the cultured recombinant host cells, lysing them and preparing the cell extract, and (b) removing the impure proteins derived from recombinant host cells.

The nucleic acid which is to be amplified by the present invention is DNA or RNA. There is no restriction at all for the sample in which such a nucleic acid is contained.

The thermostable enzyme which is used in the present invention is a thermostable DNA polymnerase having at least 30 bases/second of DNA extension rate and having a 3′-5′ exonuclease activity. Its specific example is a DNA polymerase derived from a hyperthermophilic archaeon strain KOD1 (called a KOD polymerase) and said enzyme may be either a thermostable enzyme purified from nature or an enzyme manufactured by a gene recombination technique.

The DNA extension rate in the present invention is calculated from the relationship between the reaction time and the size of the synthesized DNA in the reaction of various kinds of DNA polymerases such as KOD, Pfu, Deep Vent, Taq, etc. (5U) in each buffer using a substrate prepared by annealing a single-stranded DNA (1.6 μg) of M13 with a primer (16 pmoles) complementary thereto. It is essential in the present invention that the DNA extension rate is at least 30 bases/second.

The DNA extension rates for each of the polymerases are 105-130 bases/second for KOD polymerase, 24.8 bases/second for Pfu polymerase, 23.3 bases/second for Deep Vent polymerase and 61.0 bases/second for Taq polymerase.

On the other hand, it is essential in the present invention that the thermostable DNA polymerase has a 3′-5′ exonuclease activity.

In the present invention, the 3′-5′ exonuclease activity is determined by checking the rate of release of ³H under the optimum condition for each polymerase using a substrate wherein the 3′-end of the lambda-DNA digested with HindIII labeled with [³H]TTP.

In the 3′-5′ exonuclease activity of each polymerase, free-³H is found to be only 10-20%. In the case of Taq polymerase and Tth polymerase after an incubation period of three hours, in KOD polymerase and Pfu polymerase, it is 50-70%.

It has been confirmed that the KOD polymerase used in the present invention has a 3′-5′ exonuclease activity and that, in the gene which codes for KOD polymerase, there is a DNA conserved sequence showing a 3′-5′ exonuclease activity the same as in the case of Pfu polymerase.

In the present invention, the fact whether there is a 3′-5′ exonuclease activity is checked in such a manner that KOD polymerase is allowed to stand, using a DNA fragment into which the DNA of [³H]TTP-labelled-lambda-DNA digested with HindIII is incorporated as a substrate, at the reaction temperature of 75° C. in a buffer (20 mM Tris-HCl of pH 6.5, 10 mM KCl, 6 mM (NH₄)₂SO₄, 2 mM MgCl₂, 0.1% Triton X-100 and 10 μg/ml BSA) and the ratio of the free-[³H]TTP is determined.

At the same time, Taq polymerase and Tth polymerase having no 3′-5′ exonuclease activity and Pfu polymerase having a 3′-5′ exonuclease activity were checked using a buffer for each of them by the same manner as in the control experiments. The titer of each of the used polymerases was made 2.5 units.

The substrate DNA was prepared in such a manner that, first, 0.2 mM of DATP, dGTP, dCTP and [³H]TTP were added to 10 μg of lambda-DNA digested with HindIII, the 3′-end was elongated by Klenow polymerase, then DNA fragments were recovered by extracting with phenol and precipitated with ethanol and free mononucleotides were removed by a Spin column (manufactured by Clontech).

In the case of KOD polymerase and Pfu polymerase, 50-70% of free [³H]TTP were detected after an incubation period of three hours, in the case of Taq polymerase and Tth polymerase, only 10-20% of free [³H]TTP was noted.

It is preferred that said thermostable DNA polymerase contains an amino acid sequence given in SEQ ID No.1.

It is also preferred that said thermostable DNA polymerase is an enzyme having the following physical and chemical properties.

Action: It has a DNA synthetic activity and a 3′-5′ exonuclease activity.

DNA extension rate: at least 30 bases/second

Optimum pH: 6.5-7.5 (at 75° C.)

Optimum temperature: 75° C.

Molecular weight: about 88-90 Kda

Amino acid sequence: as mentioned in SEQ ID No.1

An example of the methods for manufacturing DNA polymerase derived from a hyperthermophilic archaeon strain KOD1 is that thermostable DNA polymerase gene was cloned from strain KOD1 which was isolated from a solfatara at a wharf on Kodakara Island, Kagoshima so that a recombinant expression vector was constructed, then a transformant prepared by transformation by said recombinant vector was cultured and the thermostable DNA polymerase was collected from the culture followed by purifying.

In the present invention, the DNA polymerase derived from the above-mentioned hyperthermophilic archaeon strain KOD1 has a DNA synthesizing activity and a 3′-5′ exonuclease activity and has a DNA extension rate of at least 30 bases/second. This property is used for conducting an amplification of nucleic acid.

The amplifying method of the present invention includes the following steps A to D.

A: modifying the target nucleic acid, if necessary, to produce single-stranded nucleic acids;

B: reacting the single-stranded nucleic acids with four kinds of dNTP and primers, wherein said primers are selected so as to be sufficiently complementary to different strands of target nucleic acid to anneal therewith, in a buffer solution which contains a thermostable DNA polymerase such that the above mentioned primers are annealed to the single-stranded nucleic acids and extention products are synthesized from the primers,

C: separating the primer extention products from the templates on which they are synthesized to produce single-stranded nucleic acids; and

D: repeatedly conducting the above mentioned steps B and C.

In the step A, the target nucleic acid is denatured if necessary to give a single-stranded nucleic acid. The means therefor may be a thermal treatment, a chemical denaturation or an enzymatic treatment. Preferably, it is a thermal treatment.

In the step B, said single-stranded nucleic acid is made to react with four kinds of DNTP (DATP, dGTP, dCTP and dTTP or dUTP) and primers with regular and inverted directions having complementary base sequences to the target nucleic acid in a buffer solution containing a thermostable DNA polymerase so that said primers are annealed to the single-stranded nucleic acid to conduct a primer extention reaction.

A primer with a regular direction and that with an inverted direction having complementary base sequences to the target nucleic acid are oligonucleotides having a base sequence which is complementary to one strand of target nucleic acid and is homologous to the other strand. Accordingly, one primer may be complementary to another primer elongate.

Preferred buffer solutions containing a thermostable DNA polymerase are Tris buffers containing divalent cation such as magnesium ion.

An example of the conditions for conducting an elongation reaction by annealing the primer is a method in which a cycle of 98° C./1 second-1 minute and 68° C./1 second-10 minutes is repeated for 30 times.

The step of separating an elongated primer for making a single strand in the step C may be a thermal treatment, a chemical treatment or an enzymatic treatment. Preferably, it is a thermal treatment or an enzymatic treatment using RNase.

In the step D, the above-mentioned steps B and C are repeated. To be more specific, it is preferred that heating and cooling of 98° C./20 seconds and 68° C./30 seconds are repeated at least for 30 cycles.

An amplifying method of the present invention is applicable to a PCR for amplifying a DNA of 20 kb or more (hereinafter, referred to as a long-PCR) as well. In this long-PCR, advantages of both high DNA extension rate of Taq polymerase and high fidelity in DNA synthesis caused by a 3′-5′ exonuclease activity of Pfu polymerase are necessary and both enzymes are used after mixing them. In this case, there is a question on a stability when both enzymes are stored in the same container because of the difference between their thermostabilities and that between the compositions of their stored solutions. However, in the DNA polymerase derived from a hyperthermophilic archaeon strain KOD1, a single enzyme exhibits both high DNA extension rate and high fidelity due to its 3′-5′ exonuclease activity whereby it is possible that a long-PCR can be conducted by its sole use.

In the present invention, the amplified product produced by the above-mentioned amplification such as a labeled probe is used whereby a target nucleic acid can be detected.

Labeled probe is an oligonucleotide having a base sequence which is complementary to a target nucleic acid and is bonded with a labeled substance or a labeled binding substance.

Examples of the labeled substance are enzymes such as alkaline phosphatase, peroxidase and galactosidase, fluorescent substances and radioactive substances while examples of the labeled binding substances are biotin and digoxigenin. Labeled substance may be bonded via biotin, digoxigenin or avidin.

A method of introducing those labels into a probe is that, during the synthesis of oligonucleotide, dNTP to which those labeled substances or labeled binding substances are bonded is used as one of the components of dNTP whereby a synthesis is conducted.

Examples of detecting a nucleic acid bonded with a labeled probe are conventionally known methods such as a Southern hybridization and a Northern hybridization. In those methods, the fact that a hybrid is formed when single-stranded DNA and RNA are complementary to each other is utilized whereby unknown nucleic acid fraction group is subjected to an agarose electrophoresis to separate it by size, then the nucleic acid fraction in the gel is subjected, for example, to an alkali treatment, the resulting single strand is transferred to a filter, immobilized and hybridized with a labeled probe.

As to a detection of the label in case an alkaline phosphatase is used as a labeled substance, when a chemoluminescent substrate such as a 1,2-dioxetane compound (PPD) is made to react therewith, only nucleic acid forming a hybrid is illuminated. This is recorded on an X-ray film whereby the size of the target nucleic acid and its position on electrophoresis can be confirmed.

A reagent kit for nucleic acid amplification according to the present invention contains primers of regular and inverted directions having base sequences complementary to target nucleic acid, four kinds of dNTP, divalent cation, thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and having a 3′-5′ exonuclease activity and a buffer solution.

An example of divalent cation is magnesium ion. Its concentration is preferably about 1-3 mM. Examples of the buffer solution are tris buffer (pH 6.5, 75° C.) and tricine buffer (pH 6.5, 75° C.).

A specific example of the composition is as follows.

20 mM Tris-HCl (pH 6.5, 75° C.)

10 mM KCl

6 mM (NH₄)₂SO₄

1-3 mM MgCl₂

0.1% Triton X-100

10 μg/ml BSA

20-200 μM dNTPs

0.1 pM-1 μM primer

0.1-250 ng template DNA.

A reagent kit for nucleic acid amplification according to the present invention contains a nucleic acid amplifying reagent comprising primers of regular and inverted directions having base sequences complementary to target nucleic acid, four kinds of dNTP, divalent cation, thermostable DNA polymerase having a DNA extension rate of at least 30 bases/second and having a 3′-5′ exonuclease activity and a buffer solution for amplification, a target nucleic acid probe and a buffer for detection. The buffer for detection varies depending upon the label. For example, it includes a color reagent or a luminous reagent.

KOD1 which is a kind of hyperthermophilic archaeon used in the present invention is a strain isolated from a solfatara at a wharf on Kodakara Island, Kagoshima.

Mycological properties of said strain are as follows.

Shape of cells: coccus, diplococcus; having flagella.

Temperature range for the growth: 65-100° C.

Optimum temperature for the growth: 95° C.

pH range for the growth: 5-9

Optimum pH: 6

Optimum salt concentration: 2-3%

Auxotrophy: heterotrophic

Oxygen demand: aerophobic

Cell membrane lipids: either type

GC content of DNA: 38%

The hyperthermophilic archaeon strain KOD1 was a coccus having a diameter of about 1 pm and had plural polar flagella. From the mycological properties of the strain, its close relationship with Pfu DNA polymerase-productive bacterium (Pyrococcus furiosus) and with Tli (Vent) DNA polymerase-productive bacterium (Thermococcus litoralis) was suggested.

Cloning of the thermostable DNA polymerase gene of the present invention is carried out as follows.

Thus, the cloning method is that a primer is designed and synthesized depending upon an amino acid sequence in a conserved region of Pfu DNA polymerase (Nucleic Acids Research, 1993, vol.21, No.2, 259-265).

First, a PCR is conducted using the above-prepared primers (e.g., SEQ ID Nos.7 and 8) taking chromosomal DNA of the hyperthermophilic archaeon strain KOD1 as a template to amplify the DNA fragment. The DNA sequence (e.g., SEQ ID No.9) of the amplified fragment is determined and, after confirming that the originally set amino acid sequence is coded for, a Southern hybridization is conducted to the cleaved product of the chromosomal DNA with a restriction enzyme using said fragment as a probe. It is preferred that the approximate size of the fragment containing the aimed DNA polymerase gene is limited to about 4-7 Kbp.

Then DNA fragment of about 4-7 Kbp is recovered from the gel, a DNA library is prepared by Escherichia coli using said fragment and a colony hybridization is carried out using the above-mentioned PCR-amplified DNA fragment (e.g., SEQ ID No.9) to collect a clone strain.

The DNA polymerase gene of the strain KOD1 cloned in the present invention is composed of 5010 bases (estimated numbers of amino acids: 1670) (SEQ ID No.5).

Upon comparison with other DNA polymerases, there is a conserved region of ADNA polymerase which is an eukaryote type (Regions 1-5) in the gene of the present invention. In addition, there are EXO 1,2,3 which are 3′→5′ exonuclease motive at the N terminal of said gene. In the conserved regions (Regions 1, 2) of the thermostable DNA polymerase gene derived from the hyperthermophilic archaeon strain KOD1, each of the intervening sequences is present and they are connected in a form where the open reading frame (ORF) is conserved.

When the thermostable DNA polymerase gene of the hyperthermophilic archaeon strain KOD1 is compared with Pfu DNA polymerase gene derived from Pyrococcus furiosus (Japanese Laid-Open Patent Publication Hei-05/328969) and with Tli (Vent) DNA polymerase gene derived from Thermococcus litoralis (Japanese Laid-Open Patent Publication He-06/7160) which are known enzymes, intervening sequence is present in the gene of the strain KOD1 of the present invention while there is no intervening sequence in the gene of the above-mentioned Pfu DNA polymerase and, in the Tli DNA polymerase gene, there are two kinds of intervening sequences but they are present within Regions 2 and 3 which are conserved regions and that greatly differs from the location where the intervening sequence in the thermostable DNA polymerase gene of KOD1 strain of the present invention exists (Refer to FIG. 7).

The gene of the present invention is a DNA which codes for the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1. An example of said DNA contains a base sequence which codes for the amino acid sequence mentioned in SEQ ID No. 1 or 5. Further, such a DNA contains a base sequence mentioned in SEQ ID No. 5 or 6 or a part thereof.

In order to express the thermostable DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 of the present invention in Escherichia coli, the intervening sequences of 1374-2453 bp and 2708-4316 bp in the base sequence shown by SEQ ID No.5 are removed by means of a PCR gene fusion to construct a DNA polymerase gene of a complete form. To be specific, a PCR is conducted on a cloned gene containing the intervening sequence by a combination of three pairs of primers to amplify the three fragments which are divided by the intervening sequence. In designing the primers used here, a part of the fragment which is to be bonded to its terminal is contained in its 5′-end. Then a PCR is conducted using the fragments to be bonded utilizing the duplicated sequence of the terminal whereby each of the fragments is bonded. Further PCR is conducted by the same manner using the resulting two kinds of fragments to give a DNA polymerase gene in a complete form containing no DNA polymerase gene derived from the strain KOD1 containing no intervening sequence.

Any vector may be used in the present invention so far as it makes cloning and expression of the thermostable DNA polymerase derived from KOD1 possible and its example is phage and plasmid. An example of the plasmid is a plasmid vector wherein an expression induced by T7 promoter is possible such as pET-8c. Other examples of the plasmid are pUC19, pBR322, pBluescript, pSP73, pGW7, pET3A and pET11C and so on. Exaples of the phage are lambda gt11, lambda DASH and lambda ZapII and so on.

Examples of the host cell used in the present invention are Escherichia coli and yeasts. Examples of Escherichia coli are JM109, 101, XL1, PR1 and BL21(DE3)pyss and so on.

In the present invention, the gene coding for the thermostable DNA polymerase derived from the above-mentioned KOD1 is inserted into the above-mentioned vector to give a recombinant vector and the host cell is subjected to a transformation using said recombinant vector.

In the production method of the present invention, the above-mentioned recombinant host cell is cultured whereby the thermostable DNA polymerase gene derived from the strain KOD1 is induced and expressed. The culture medium used for the culture of the recombinant host cell and the condition therefor follow the conventional methods.

In a specific example, Escherichia coli which is transformed by pET-8c plasmid containing a DNA polymerase gene in a complete form containing no intervening sequence derived from the strain KOD1 is cultured, for example, in a TB medium whereby an induction treatment is conducted. It is preferred that the induction treatment of T7 promoter is carried out by addition of isopropylthio-β-D-galactoside.

The purifying method of the present invention includes, after culturing the recombinant host cells, a step wherein (a) recombinant host cells are collected, lysed and the cell extract is prepared and a step wherein (b) impure protein derived from the host cells is removed.

The thermostable DNA polymerase which is produced from the recombinant host cells is separated and recovered from the culture liquid by means of centrifugation or the like after culturing the host bacterial cells in a medium followed by inducing. After said bacterial cells are resuspended in a buffer, they are lysed by means of ultrasonic treatment, Dyno mill, French press, etc. Then a thermal treatment is conducted and the heat stable DNA polymerase is recovered from the supernatant fluid. In disintegrating the bacterial cells, ultrasonic treatment, Dyno mill and French press method are preferred.

A thermal treatment is preferred as one of the steps for removing the impure protein derived from the host cells. The condition for the thermal treatment is at 70° C. or higher or, preferably, at 90° or higher. Other means for removing the impure protein are various chromatographic techniques.

Molecular weight of the thermostable DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 obtained as such is about 90 KDa (cf. FIG. 5).

When a polymerase chain reaction is conducted using said thermostable DNA polymerase, a sufficient amplification of the aimed DNA fragments is confirmed (cf. FIG. 6).

Now the present invention will be illustrated by referring partly to the drawings wherein:

FIG. 1 is a photographic picture of electrophoresis as a substitute for a drawing and shows the result of the measurement of the DNA extension rate of the KOD polymerase;

FIG. 2 is a photographic picture of electrophoresis as a substitute for a drawing and shows the comparison of the DNA extension rate of various thermostable DNA polymerases in which FIG. 2a shows the cases of KOD polymerase and Pfu polymerase while FIG. 2b shows the cases of Deep Vent polymerase and Taq polymerase;

FIG. 3 is a photographic picture of electrophoresis as a substitute for a drawing and shows the comparison of the PCR due to the difference in the reaction time of various thermostable DNA polymerase;

FIG. 4 shows the constructive charts of the recombinant expression vector;

FIG. 5 is a photographic picture of electrophoresis as a substitute for a drawing and shows the result of the measurement of molecular weight of the thermostable DNA polymerase derived from KOD1;

FIG. 6 is a photographic picture of electrophoresis as a substitute for a drawing and shows the result of the PCR by the thermostable DNA polymerase derived from KOD1; and

FIG. 7 is drawing which shows a comparison of the DNA polymerase gene derived from the hyperthermophilic archaeon strain KOD1 with the thermostable DNA polymerase gene derived from Pyrococcus furiosus and that derived from Thermococcus litoralis which are thought to be similar bacteria.

EXAMPLE 1

Cloning of DNA Polymerase Gene Derived from Hyperthermophilic Archaeon Strain KOD1

The hyperthermophilic archaeon strain KOD1 isolated in Kodakara Island, Kagoshima was cultured at 95° C. and then the bacterial cells were recovered. Chromosomal DNA of the hyperthermophilic archaeon strain KOD1 was prepared by a conventional method from the resulting bacterial cells.

Two kinds of primers (5′-GGATTAGTATAGTGCCAATGGAAGGCGAC-3′ [SEQ ID No.7] and 5′-GAGGGCGAAGTTTATTCCGAGCTT-3′ [SEQ ID No.8]) were synthesized based upon the amino acid sequence at the conserved region of the DNA polymerase (Pfu polymerase) derived from Pyrococcus furiosus. A PCR was carried out using those two primers where the prepared chromosomal DNA was used as a template.

After the base sequence (SEQ ID No.9) of the PCR-amplified DNA fragment was determined and the amino acid sequence (SEQ ID No.10) was determined, a Southern hybridization was conducted using said amplified DNA fragment to the product of the strain KOD1 chromosomal DNA treated with a restriction enzyme whereby the size of the fragment coding for the DNA polymerase was calculated (about 4-7 Kbp). Further, the DNA fragment of this size was recovered from agarose gel, inserted into a plasmid pBS (manufactured by Stratgene) and Escherichia coli (E. coli JM 109) was transformed by this mixture to prepare a library.

A colony hybridization was conducted using a probe (SEQ ID No.9) used for the Southern hybridization to obtain a clone strain (E. coli JM109/pBSKOD1) which is thought to contain the DNA polymerase gene derived from strain KOD1.

EXAMPLE 2

Determination of Base Sequence of the Clone Fragment

A plasmid pBSKOD1 was recovered from the clone strain E. coli JM109/pBSKOD1 obtained in Example 1 and its base sequence (SEQ ID No.5) was determined by a conventional method. Further, the amino acid sequence was presumed from the determined base sequence. The DNA polymerase gene derived from KOD1 strain comprised 5010 bases wherein 1670 amino acids were coded.

EXAMPLE 3

Construction of Recombinant Expression Vector

In order to prepare a complete polymerase gene, the intervening sequence parts at two places (1374-2453 bp and 2708-4316 bp) were removed by a PCR fusion method. In the PCR fusion method, three pairs of primers (SEQ ID Nos.11-16) were combined using a primer recovered from the clone strain as a template and a PCR was conducted for each of them to amplify three fragments wherefrom the intervening sequences were removed. At that time, the primer used for the PCR was designed in such a manner that the side which binds to another fragment has the same sequence as the binding partner has. In addition, a design was conducted in such a manner that different restriction enzyme sites (EcoRV at N-terminal while BamHI at C-terminal) were created at both ends.

After that, among the PCR-amplified fragments, that which is located at the central part of the structure and that which is located at the N-terminal side are mixed and a PCR was conducted using each of the fragments as a primer. At the same time, the fragment located at the central part of the structure and that located at the C-terminal side are mixed and a PCR was conducted using each of the fragments as a primer. Two kinds of fragments obtained as such were subjected to a PCR once again to give gene fragments in a complete form having no intervening sequence, having EcoRV and BamHI sites at the N- and C-terminals, respectively and coding for the DNA polymerase derived from strain KOD1.

Further, said gene was subcloned using an expression vector which. can be induced by T7 promoter, an NcoI/BamHI site of pET-8c and the previously-created restriction enzyme site to give a recombinant expression vector (pET-pol).

EXAMPLE 4

Expression and Purification of DNA Polymerase Derived from KOD1

Escherichia coli (BL21(DE3)) was transformed using a recombinant expression vector (pET-pol) obtained in Example 3, the resulting transformant was cultured in a TB medium (mentioned in Molecular Cloning, p.A.2, 1989) and, at one hour before collecting the bacterial cells, an induction treatment of T7 promoter was conducted by addition of isopropylthio-β-D-galactopyrenoside. Bacterial cells were recovered from the cultured liquid by means of centrifugation. They were resuspended in a buffer and disintegrated by an ultrasonic treatment to give a cell extract. In order to remove the impure protein derived from the host cells, the disintegrated cell solution was treated at 94° C. for 20 minutes whereby the impure protein derived from the host cells trifugation to give a thermostable DNA polymerase derived from strain KOD1.

The Eschericia coli BL21 (DE3) pER-pol was deposited on Apr. 22, 1996 under the Budepest Treaty at National Institute of Bioscience and Human-Technology Agency of Industrial Science and Technology (1-3, Higashi 1 chome Tsukuba-shi Ibaraki-ken 305, JAPAN) in accordance with the Budapest Treaty under the accession number FERM BP-5513.

EXAMPLE 5

Purification of Thermostable DNA Polymerase Derived from KOD1

Molecular weight of the thenmostable DNA polymerase derived from KOD1 obtained in Example 4 was calculated by means of an SDS-PAGE method whereby it was found to be about 86-92 kDa (FIG. 5). Further, a PCR was conducted using the thermostable DNA polymerase derived from KOD1 obtained in Example 4 and the known template primer whereupon a DNA fragment which was to be a target was confirmed (FIG. 6) by the same manner as in the case where the thermostable DNA polymerase derived from Thermococcus litoralis was used and a high thermostable DNA polymerase activity was confirmed.

COMPARATIVE EXAMPLE 1

Comparison with the Thermostable DNA Polymerase Gene Derived from Pyrococcus furiosus or from Thermococcus litoralis which are to be Similar to the Hyperthermphilic Archaeon Strain KOD1 of the Present Invention.

Amino acid sequences were estimated from the DNA sequences of the DNA polymerase gene derived from the hyperthermophilic archaeon strain KOD1 of the present invention (SEQ ID No.6), the thermostable DNA polymerase gene derived from Pyrococcus furiosus (Japanese Laid-Open Patent Publication Hei-5/328969) and the thermostable DNA polymerase gene derived from Thermococcus litoralis (Japanese Laid-Open Patent Publication Hei-6/7160) and were compared and investigated.

In the DNA polymerase derived from KOD1 of the present invention, there were Regions 1-5 which were the conserved regions of αDNA polymerase of an eurokaryotic type. Further, there were EXO1, 2 and 3 which were 3′→45′ exonuclease motives at the N-terminal side. However, in each of the Region 1 and Region 2 which were the aDNA polymerase conserved regions, there were intervening sequences IVS-A and IVS-B (refer to FIG. 7).

On the other hand, in Pfu polymerase which is a thermostable DNA polymerase derived from Pyrococcus furiosus, there was no intervening sequence. In the case of Vent polymerase which is a thermostable DNA polymerase derived from Thermococcus litoralis, there were the intervening sequences (IVS1 and IVS2) in the αDNA polymerase conserved regions (Region 2 and Region 3) (refer to FIG. 7).

EXAMPLE 6

Measurement of DNA Extension Rate of the DNA Polymerase Derived from Hyperthermophilic archaeon strain KOD1

DNA prepared by annealing the M13mp18DNA with M13P7 primer having a base sequence as mentioned in SEQ ID No.2 was used as a substrate and the rate of synthesizing the DNA in a reaction buffer solution [20 mM Tris-HCl (pH 7.5 at 75° C.), 10 mM KCl, 6 mM (NH₄)₂SO₄, 2 mM MgCl₂, 0.1% Triton X-100 and 10 μg/ml nuclease-free BSA] containing the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 manufactured in Examples 1-5 was investigated for the reaction time of 20, 40, 60, 80 and 100 seconds (FIG. 1) or 40, 60, 80 and 100 seconds (FIG. 2). The results are given in FIG. 1 and in FIG. 2.

A part of the DNA sample during the elongation reaction was taken out for each reaction time and was added to a reaction stopping solution (60 mM EDTA, 60 μM NaOH, 0.1% BPB and 30% glycerol) in the same amount.

The DNA samples obtained in the above process were separated and analyzed by means of an alkaline agarose electrophoresis and the size of the synthesized DNA was checked.

1, 2, 3, 4 and 5 in FIG. 1 show the results of the reactions for 0.3 minute (20 seconds), 0.7 minute (40 seconds), 1 minute (60 seconds), 1.3 minutes (80 seconds) and 1.7 minutes (100 seconds), respectively. It is apparent from FIG. 1 that the DNA extension rate of the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 was 105 bases/second. 1, 2, 3 and 4 in FIG. 2 show the results of the reaction for 0.7 minute (40 seconds), 1 minute (60 seconds), 1.3 minutes (80 seconds) and 1.7 minutes (100 seconds), respectively. It is apparent from FIG. 2 that the DNA extension rate of the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 was 138 bases/second.

On the other hand, the DNA synthesizing rate of each of Pfu polymerase (Stratgene), Deep Vent polymerase (New England Biolabo) and Taq polymerase (Takara Shuzo) was measured by the same manner in each of the buffers therefor (FIG. 2a and FIG. 2b). The DNA extension rates of those DNA polymerases were 24.8 bases/second for Pfu polymerase, 23.2 bases/second for Deep Vent polymerase and 61.0 bases/second for Taq polymerase.

From the above results, it was suggested that the DNA extension rate of the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 was about six-fold of those of Pfu polymerase and Deep Vent polymerase and about two-fold of that of Taq polymerase.

EXAMPLE 7

Measurement of Fidelity of the DNA Polymerase Derived from the Hyperthermophilic Archaeon Strain KOD1 in the Reaction for the Synthesis of DNA

A rate for resulting in an error in the DNA synthesis was measured by a method of Kunkel (Kunkel, 1985, Journal of Biological Chemistry, 260, 5787-5796). In this method, a DNA synthesis reaction was conducted using a DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 manufactured in Examples 1-5 using an M13mpl8DNA having a gap at a lacZ part containing a part of the genes coding for β-galactosidase as a substrate and transfected to E. coli JM109 in an NZY medium containing 5-bromo-4-chloro-3-indolyl-β-D-galactoside and isopropyl-thio-β-D-galactoside using an M13mp18DNA in which lacZ part was double-stranded.

When β-galactosidase wherein a function is lost or lowered was expressed due to a reading error or a frame shift during the synthetic reaction of DNA, it is not possible to utilize 5-bromo-4-chloro-3-indolyl-β-D-galactoside whereupon the color of plaque becomes colorless or light blue. On the other hand, when there is no error in the synthesized DNA and a complete β-galactosidase was expressed, plaque becomes blue. The rate of induction of error was measured in the DNA synthesis from the rate of the sum of colorless and light blue plaque to the total plaque.

The rate of induction of error-in the DNA synthesis was also measured for Pfu polymerase (Stratgene), Taq polymerase (Takara Shuzo) and delta Tth polymerase (Toyobo) which were made to react by the same manner.

Further, the rate of induction of error in the DNA synthesis was also measured for a mixture of Taq polymerase and Pfu polymerase. The results are given in Table 1.

TABLE 1 Measurement of Fidelity in the Reaction of DNA Synthesis of DNA Polymerase Derived from Hyperthermophilic archaeon strain KOD1 Light Mutant Enzyme Blue White Mutant Total Frequence(10⁻⁴) KOD1 pol. 12 11 23 6619 37.7 Pfu 15 15 30 7691 39.0 Taq 30 24 54 4141 130 ▴Tth 70 45 115  7375 156 Taq/Pfu(20:1) 10 20 30 4238 63.7 Taq/Pfu(50:1) 10 13 23 4489 53.5

It is apparent from Table 1 that the fidelity of the DNA polymerase derived from hyperthermophilic archaeon strain KOD1 in the DNA synthesis reaction is suggested to be superior to Taq polymerase and same as Pfu polymerase. In addition, a mixture of Taq polymerase and Pfu polymerase exhibits a medium fidelity that it is superior to Taq polymerase and inferior to Pfu polymerase.

EXAMPLE 8

Comparison in PCR of Various Thermostable DNA Polymerases by the Difference in the Reaction Time

Lambda-DNA (3 μg) was used as a target nucleic acid; oligonucleotides having a sequence as mentioned in SEQ ID Nos. 3 and 4 were used as primers; and a buffer containing 20 mM Tri-HCl (pH 7.5 at 75° C.), 10 mM KCl, 6 mM (NH₄)₂SO₄, 2 mM MgCl₂, 0.1% Triton X-100, 10 μg/ml BSA and 200 μM dNTPs was used as a buffer. DNA polymerase derived from hyperthermophilic archaeon strain KOD1 (KOD polymerase), Taq polymerase which is widely used for PCR and Pfu polymerase which exhibits 3′-5′ exonuclease activity were also used as the thermostable DNA polymerases. The used titer of each polymerase was 2 units.

A PCR amplification reaction was conducted using a DNA Thermal Cycler (Perkin-Elmer) in a schedule wherein a cycle comprising 94° C./20 seconds and 68° C./x second (x: reaction time) was repeated for 30 times. In the case of the DNA polymerase derived from the hyperthermophilic archaeon strain KOD1 (KOD polymerase), amplification of the target DNA was confirmed by conducting 30 cycles of 94° C./20 seconds-68° C./1 second while, in the case of Taq polymerase, amplification of DNA was first confirmed by conducting 30 cycles of 94° C./20 seconds-68° C./10 seconds. In the case of Pfu polymerase, amplification of DNA was at least confirmed by conducting 30 cycles of 94° C./20 seconds-68° C./1 minute. The results are given in FIG. 3.

In the present invention, it is possible to amplify the DNA with a high fidelity within a short reaction time when a DNA polymerase derived from hyperthermophilic archaeon strain KOD1 which is a thermostable DNA polymerase having at least 30 bases/second of DNA extension rate and having a 3′-5′ exonuclease activity. When this method is made into a form of a kit, it is possible to improve the simplicity and convenience. In addition, when only one kind of thermostable DNA polymerase having both high extension rate (at least 30 bases/second) which has not been available yet and 3′-5′ exonuclease activity is used, it is possible to shorten the time for the primer extention reaction and to amplify the relatively big product with a high fidelity.

16 1 774 PRT Hyperthermophilic archaeon 1 Met Ile Leu Asp Thr Asp Tyr Ile Thr Glu Asp Gly Lys Pro Val Ile 5 10 15 Arg Ile Phe Lys Lys Glu Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg 20 25 30 Thr Phe Glu Pro Tyr Phe Tyr Ala Leu Leu Lys Asp Asp Ser Ala Ile 35 40 45 Glu Glu Val Lys Lys Ile Thr Ala Glu Arg His Gly Thr Val Val Thr 50 55 60 Val Lys Arg Val Glu Lys Val Gln Lys Lys Phe Leu Gly Arg Pro Val 65 70 75 80 Glu Val Trp Lys Leu Tyr Phe Thr His Pro Gln Asp Val Pro Ala Ile 85 90 95 Arg Asp Lys Ile Arg Glu His Gly Ala Val Ile Asp Ile Tyr Glu Tyr 100 105 110 Asp Ile Pro Phe Ala Lys Arg Tyr Leu Ile Asp Lys Gly Leu Val Pro 115 120 125 Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe Asp Ile Gln Thr 130 135 140 Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro Ile Leu Met Ile 145 150 155 160 Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val Ile Thr Trp Lys Asn Val 165 170 175 Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg Glu Met Ile Lys 180 185 190 Arg Phe Leu Arg Val Val Lys Glu Lys Asp Pro Asp Val Leu Ile Thr 195 200 205 Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu 210 215 220 Lys Leu Gly Ile Asn Phe Ala Leu Gly Arg Asp Gly Ser Glu Pro Lys 225 230 235 240 Ile Gln Arg Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Arg Ile 245 250 255 His Phe Asp Leu Tyr Pro Val Ile Arg Arg Thr Ile Asn Leu Pro Thr 260 265 270 Tyr Thr Leu Glu Ala Val Tyr Glu Ala Val Phe Gly Gln Pro Lys Glu 275 280 285 Lys Val Tyr Ala Glu Glu Ile Thr Pro Ala Trp Glu Thr Gly Glu Asn 290 295 300 Leu Glu Arg Val Ala Arg Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 305 310 315 320 Glu Leu Gly Lys Glu Phe Leu Pro Met Glu Ala Gln Leu Ser Arg Leu 325 330 335 Ile Gly Gln Ser Leu Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 340 345 350 Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 355 360 365 Pro Asn Lys Pro Asp Glu Lys Glu Leu Ala Arg Arg Arg Gln Ser Tyr 370 375 380 Glu Gly Gly Tyr Val Lys Glu Pro Glu Arg Gly Leu Trp Glu Asn Ile 385 390 395 400 Val Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser Ile Ile Ile Thr His 405 410 415 Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Lys Glu Tyr Asp 420 425 430 Val Ala Pro Gln Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly Phe 435 440 445 Ile Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gln Lys Ile Lys 450 455 460 Lys Lys Met Lys Ala Thr Ile Asp Pro Ile Glu Arg Lys Leu Leu Asp 465 470 475 480 Tyr Arg Gln Arg Ala Ile Lys Ile Leu Ala Asn Ser Tyr Tyr Gly Tyr 485 490 495 Tyr Gly Tyr Ala Arg Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser 500 505 510 Val Thr Ala Trp Gly Arg Glu Tyr Ile Thr Met Thr Ile Lys Glu Ile 515 520 525 Glu Glu Lys Tyr Gly Phe Lys Val Ile Tyr Ser Asp Thr Asp Gly Phe 530 535 540 Phe Ala Thr Ile Pro Gly Ala Asp Ala Glu Thr Val Lys Lys Lys Ala 545 550 555 560 Met Glu Phe Leu Asn Tyr Ile Asn Ala Lys Leu Pro Gly Ala Leu Glu 565 570 575 Leu Glu Tyr Glu Gly Phe Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys 580 585 590 Lys Tyr Ala Val Ile Asp Glu Glu Gly Lys Ile Thr Thr Arg Gly Leu 595 600 605 Glu Ile Val Arg Arg Asp Trp Ser Glu Ile Ala Lys Glu Thr Gln Ala 610 615 620 Arg Val Leu Glu Ala Leu Leu Lys Asp Gly Asp Val Glu Lys Ala Val 625 630 635 640 Arg Ile Val Lys Glu Val Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro 645 650 655 Pro Glu Lys Leu Val Ile His Glu Gln Ile Thr Arg Asp Leu Lys Asp 660 665 670 Tyr Lys Ala Thr Gly Pro His Val Ala Val Ala Lys Arg Leu Ala Ala 675 680 685 Arg Gly Val Lys Ile Arg Pro Gly Thr Val Ile Ser Tyr Ile Val Leu 690 695 700 Lys Gly Ser Gly Arg Ile Gly Asp Arg Ala Ile Pro Phe Asp Glu Phe 705 710 715 720 Asp Pro Thr Lys His Lys Tyr Asp Ala Glu Tyr Tyr Ile Glu Asn Gln 725 730 735 Val Leu Pro Ala Val Glu Arg Ile Leu Arg Ala Phe Gly Tyr Arg Lys 740 745 750 Glu Asp Leu Arg Tyr Gln Lys Thr Arg Gln Val Gly Leu Ser Ala Trp 755 760 765 Leu Lys Pro Lys Gly Thr 770 2 24 DNA Hyperthermophilic archaeon 2 cgccagggtt ttcccagtca cgac 24 3 20 DNA Hyperthermophilic archaeon 3 gggcggcgac ctcgcgggtt 20 4 24 DNA Hyperthermophilic archaeon 4 gcccataata atctgccggt caat 24 5 5342 DNA Hyperthermophilic archaeon 5 gcttgagggc ctgcggttat gggacgttgc agtttgcgcc tactcaaaga tgccggtttt 60 ataacggaga aaaatgggga gctattacga tctctccttg atgtggggtt tacaataaag 120 cctggattgt tctacaagat tatgggggat gaaag atg atc ctc gac act gac 173 Met Ile Leu Asp Thr Asp 5 tac ata acc gag gat gga aag cct gtc ata aga att ttc aag aag gaa 221 Tyr Ile Thr Glu Asp Gly Lys Pro Val Ile Arg Ile Phe Lys Lys Glu 10 15 20 aac ggc gag ttt aag att gag tac gac cgg act ttt gaa ccc tac ttc 269 Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg Thr Phe Glu Pro Tyr Phe 25 30 35 tac gcc ctc ctg aag gac gat tct gcc att gag gaa gtc aag aag ata 317 Tyr Ala Leu Leu Lys Asp Asp Ser Ala Ile Glu Glu Val Lys Lys Ile 40 45 50 acc gcc gag agg cac ggg acg gtt gta acg gtt aag cgg gtt gaa aag 365 Thr Ala Glu Arg His Gly Thr Val Val Thr Val Lys Arg Val Glu Lys 55 60 65 70 gtt cag aag aag ttc ctc ggg aga cca gtt gag gtc tgg aaa ctc tac 413 Val Gln Lys Lys Phe Leu Gly Arg Pro Val Glu Val Trp Lys Leu Tyr 75 80 85 ttt act cat ccg cag gac gtc cca gcg ata agg gac aag ata cga gag 461 Phe Thr His Pro Gln Asp Val Pro Ala Ile Arg Asp Lys Ile Arg Glu 90 95 100 cat gga gca gtt att gac atc tac gag tac gac ata ccc ttc gcc aag 509 His Gly Ala Val Ile Asp Ile Tyr Glu Tyr Asp Ile Pro Phe Ala Lys 105 110 115 cgc tac ctc ata gac aag gga tta gtg cca atg gaa ggc gac gag gag 557 Arg Tyr Leu Ile Asp Lys Gly Leu Val Pro Met Glu Gly Asp Glu Glu 120 125 130 ctg aaa atg ctc gcc ttc gac att caa act ctc tac cat gag ggc gag 605 Leu Lys Met Leu Ala Phe Asp Ile Gln Thr Leu Tyr His Glu Gly Glu 135 140 145 150 gag ttc gcc gag ggg cca atc ctt atg ata agc tac gcc gac gag gaa 653 Glu Phe Ala Glu Gly Pro Ile Leu Met Ile Ser Tyr Ala Asp Glu Glu 155 160 165 ggg gcc agg gtg ata act tgg aag aac gtg gat ctc ccc tac gtt gac 701 Gly Ala Arg Val Ile Thr Trp Lys Asn Val Asp Leu Pro Tyr Val Asp 170 175 180 gtc gtc tcg acg gag agg gag atg ata aag cgc ttc ctc cgt gtt gtg 749 Val Val Ser Thr Glu Arg Glu Met Ile Lys Arg Phe Leu Arg Val Val 185 190 195 aag gag aaa gac ccg gac gtt ctc ata acc tac aac ggc gac aac ttc 797 Lys Glu Lys Asp Pro Asp Val Leu Ile Thr Tyr Asn Gly Asp Asn Phe 200 205 210 gac ttc gcc tat ctg aaa aag cgc tgt gaa aag ctc gga ata aac ttc 845 Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu Lys Leu Gly Ile Asn Phe 215 220 225 230 gcc ctc gga agg gat gga agc gag ccg aag att cag agg atg ggc gac 893 Ala Leu Gly Arg Asp Gly Ser Glu Pro Lys Ile Gln Arg Met Gly Asp 235 240 245 agg ttt gcc gtc gaa gtg aag gga cgg ata cac ttc gat ctc tat cct 941 Arg Phe Ala Val Glu Val Lys Gly Arg Ile His Phe Asp Leu Tyr Pro 250 255 260 gtg ata aga cgg acg ata aac ctg ccc aca tac acg ctt gag gcc gtt 989 Val Ile Arg Arg Thr Ile Asn Leu Pro Thr Tyr Thr Leu Glu Ala Val 265 270 275 tat gaa gcc gtc ttc ggt cag ccg aag gag aag gtt tac gct gag gaa 1037 Tyr Glu Ala Val Phe Gly Gln Pro Lys Glu Lys Val Tyr Ala Glu Glu 280 285 290 ata aca cca gcc tgg gaa acc ggc gag aac ctt gag aga gtc gcc cgc 1085 Ile Thr Pro Ala Trp Glu Thr Gly Glu Asn Leu Glu Arg Val Ala Arg 295 300 305 310 tac tcg atg gaa gat gcg aag gtc aca tac gag ctt ggg aag gag ttc 1133 Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr Glu Leu Gly Lys Glu Phe 315 320 325 ctt ccg atg gag gcc cag ctt tct cgc tta atc ggc cag tcc ctc tgg 1181 Leu Pro Met Glu Ala Gln Leu Ser Arg Leu Ile Gly Gln Ser Leu Trp 330 335 340 gac gtc tcc cgc tcc agc act ggc aac ctc gtt gag tgg ttc ctc ctc 1229 Asp Val Ser Arg Ser Ser Thr Gly Asn Leu Val Glu Trp Phe Leu Leu 345 350 355 agg aag gcc tat gag agg aat gag ctg gcc ccg aac aag ccc gat gaa 1277 Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala Pro Asn Lys Pro Asp Glu 360 365 370 aag gag ctg gcc aga aga cgg cag agc tat gaa gga ggc tat gta aaa 1325 Lys Glu Leu Ala Arg Arg Arg Gln Ser Tyr Glu Gly Gly Tyr Val Lys 375 380 385 390 gag ccc gag aga ggg ttg tgg gag acc ata gtg tac cta gat ttt aga 1373 Glu Pro Glu Arg Gly Leu Trp Glu Asn Ile Val Tyr Leu Asp Phe Arg 395 400 405 tgc cat cca gcc gat acg aag gtt gtc gtc aag ggg aag ggg att ata 1421 Cys His Pro Ala Asp Thr Lys Val Val Val Lys Gly Lys Gly Ile Ile 410 415 420 aac atc agc gag gtt cag gaa ggt gac tat gtc ctt ggg att gac ggc 1469 Asn Ile Ser Glu Val Gln Glu Gly Asp Tyr Val Leu Gly Ile Asp Gly 425 430 435 tgg cag aga gtt aga aaa gta tgg gaa tac gac tac aaa ggg gag ctt 1517 Trp Gln Arg Val Arg Lys Val Trp Glu Tyr Asp Tyr Lys Gly Glu Leu 440 445 450 gta aac ata aac ggg tta aag tgt acg ccc aat cat aag ctt ccc gtt 1565 Val Asn Ile Asn Gly Leu Lys Cys Thr Pro Asn His Lys Leu Pro Val 455 460 465 470 gtt aca aag aac gaa cga caa acg aga ata aga gac agt ctt gct aag 1613 Val Thr Lys Asn Glu Arg Gln Thr Arg Ile Arg Asp Ser Leu Ala Lys 475 480 485 tct ttc ctt act aaa aaa gtt aag ggc aag ata ata acc act ccc ctt 1661 Ser Phe Leu Thr Lys Lys Val Lys Gly Lys Ile Ile Thr Thr Pro Leu 490 495 500 ttc tat gaa ata ggc aga gcg aca agt gag aat att cca gaa gaa gag 1709 Phe Tyr Glu Ile Gly Arg Ala Thr Ser Glu Asn Ile Pro Glu Glu Glu 505 510 515 gtt ctc aag gga gag ctc gct ggc ata cta ttg gct gaa gga acg ctc 1757 Val Leu Lys Gly Glu Leu Ala Gly Ile Leu Leu Ala Glu Gly Thr Leu 520 525 530 ttg agg aaa gac gtt gaa tac ttt gat tca tcc cgc aaa aaa cgg agg 1805 Leu Arg Lys Asp Val Glu Tyr Phe Asp Ser Ser Arg Lys Lys Arg Arg 535 540 545 550 att tca cac cag tat cgt gtt gag ata acc att ggg aaa gac gag gag 1853 Ile Ser His Gln Tyr Arg Val Glu Ile Thr Ile Gly Lys Asp Glu Glu 555 560 565 gag ttt agg gat cgt atc aca tac att ttt gag cgt ttg ttt ggg att 1901 Glu Phe Arg Asp Arg Ile Thr Tyr Ile Phe Glu Arg Leu Phe Gly Ile 570 575 580 act cca agc atc tcg gag aag aaa gga act aac gca gta aca ctc aaa 1949 Thr Pro Ser Ile Ser Glu Lys Lys Gly Thr Asn Ala Val Thr Leu Lys 585 590 595 gtt gcg aag aag aat gtt tat ctt aaa gtc aag gaa att atg gac aac 1997 Val Ala Lys Lys Asn Val Tyr Leu Lys Val Lys Glu Ile Met Asp Asn 600 605 610 ata gag tcc cta cat gcc ccc tcg gtt ctc agg gga ttc ttc gaa ggc 2045 Ile Glu Ser Leu His Ala Pro Ser Val Leu Arg Gly Phe Phe Glu Gly 615 620 625 630 gac ggt tca gta aac agg gtt agg agg agt att gtt gca acc cag ggt 2093 Asp Gly Ser Val Asn Arg Val Arg Arg Ser Ile Val Ala Thr Gln Gly 635 640 645 aca aag aac gag tgg aag att aaa ctg gtg tca aaa ctg ctc tcc cag 2141 Thr Lys Asn Glu Trp Lys Ile Lys Leu Val Ser Lys Leu Leu Ser Gln 650 655 660 ctt ggt atc cct cat caa acg tac acg tat cag tat cag gaa aat ggg 2189 Leu Gly Ile Pro His Gln Thr Tyr Thr Tyr Gln Tyr Gln Glu Asn Gly 665 670 675 aaa gat cgg agc agg tat ata ctg gag ata act gga aag gac gga ttg 2237 Lys Asp Arg Ser Arg Tyr Ile Leu Glu Ile Thr Gly Lys Asp Gly Leu 680 685 690 ata ctg ttc caa aca ctc att gga ttc atc agt gaa aga aag aac gct 2285 Ile Leu Phe Gln Thr Leu Ile Gly Phe Ile Ser Glu Arg Lys Asn Ala 695 700 705 710 ctg ctt aat aag gca ata tct cag agg gaa atg aac aac ttg gaa aac 2333 Leu Leu Asn Lys Ala Ile Ser Gln Arg Glu Met Asn Asn Leu Glu Asn 715 720 725 aat gga ttt tac agg ctc agt gaa ttc aat gtc agc acg gaa tac tat 2381 Asn Gly Phe Tyr Arg Leu Ser Glu Phe Asn Val Ser Thr Glu Tyr Tyr 730 735 740 gag ggc aag gtc tat gac tta act ctt gaa gga act ccc tac tac ttt 2429 Glu Gly Lys Val Tyr Asp Leu Thr Leu Glu Gly Thr Pro Tyr Tyr Phe 745 750 755 gcc aat ggc ata ttg acc cat aac tcc ctg tac ccc tca atc atc atc 2477 Ala Asn Gly Ile Leu Thr His Asn Ser Leu Tyr Pro Ser Ile Ile Ile 760 765 770 acc cac aac gtc tcg ccg gat acg ctc aac aga gaa gga tgc aag gaa 2525 Thr His Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Lys Glu 775 780 785 790 tat gac gtt gcc cca cag gtc ggc cac cgc ttc tgc aag gac ttc cca 2573 Tyr Asp Val Ala Pro Gln Val Gly His Arg Phe Cys Lys Asp Phe Pro 795 800 805 gga ttt atc ccg agc ctg ctt gga gac ctc cta gag gag agg cag aag 2621 Gly Phe Ile Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gln Lys 810 815 820 ata aag aag aag atg aag gcc acg att gac ccg atc gag agg aag ctc 2669 Ile Lys Lys Lys Met Lys Ala Thr Ile Asp Pro Ile Glu Arg Lys Leu 825 830 835 ctc gat tac agg cag agg gcc atc aag atc ctg gca aac agc atc cta 2717 Leu Asp Tyr Arg Gln Arg Ala Ile Lys Ile Leu Ala Asn Ser Ile Leu 840 845 850 ccc gag gaa tgg ctt cca gtc ctc gag gaa ggg gag gtt cac ttc gtc 2765 Pro Glu Glu Trp Leu Pro Val Leu Glu Glu Gly Glu Val His Phe Val 855 860 865 870 agg att gga gag ctc ata gac cgg atg atg gag gaa aat gct ggg aaa 2813 Arg Ile Gly Glu Leu Ile Asp Arg Met Met Glu Glu Asn Ala Gly Lys 875 880 885 gta aag aga gag ggc gag acg gaa gtg ctt gag gtc agt ggg ctt gaa 2861 Val Lys Arg Glu Gly Glu Thr Glu Val Leu Glu Val Ser Gly Leu Glu 890 895 900 gtc ccg tcc ttt aac agg aga act aac aag gcc gag ctc aag aga gta 2909 Val Pro Ser Phe Asn Arg Arg Thr Asn Lys Ala Glu Leu Lys Arg Val 905 910 915 aag gcc ctg att agg cac gat tat tct ggc aag gtc tac acc atc aga 2957 Lys Ala Leu Ile Arg His Asp Tyr Ser Gly Lys Val Tyr Thr Ile Arg 920 925 930 ctg aag tcg ggg agg aga ata aag ata acc tct ggc cac agc ctc ttc 3005 Leu Lys Ser Gly Arg Arg Ile Lys Ile Thr Ser Gly His Ser Leu Phe 935 940 945 950 tct gtg aga aac ggg gag ctc gtt gaa gtt acg ggc gat gaa cta aat 3053 Ser Val Arg Asn Gly Glu Leu Val Glu Val Thr Gly Asp Glu Leu Lys 955 960 965 cca ggt gac ctc gtt gca gtc ccg cgg aga ttg gag ctt cct gag aga 3101 Pro Gly Asp Leu Val Ala Val Pro Arg Arg Leu Glu Leu Pro Glu Arg 970 975 980 aac cac gtg ctg aac ctc gtt gaa ctg ctc ctt gga acg cca gaa gaa 3149 Asn His Val Leu Asn Leu Val Glu Leu Leu Leu Gly Thr Pro Glu Glu 985 990 995 gaa act ttg gac atc gtc atg acg atc cca gtc aag ggt aag aag aac 3197 Glu Thr Leu Asp Ile Val Met Thr Ile Pro Val Lys Gly Lys Lys Asn 1000 1005 1010 ttc ttt aaa ggg atg ctc agg act ttg cgc tgg att ttc gga gag gaa 3245 Phe Phe Lys Gly Met Leu Arg Thr Leu Arg Trp Ile Phe Gly Glu Glu 1015 1020 1025 1030 aag agg ccc aga acc gcg aga cgc tat ctc agg cac ctt gag gat ctg 3293 Lys Arg Pro Arg Thr Ala Arg Arg Tyr Leu Arg His Leu Glu Asp Leu 1035 1040 1045 ggc tat gtc cgg ctt aag aag atc ggc tac gaa gtc ctc gac tgg gac 3341 Gly Tyr Val Arg Leu Lys Lys Ile Gly Tyr Glu Val Leu Asp Trp Asp 1050 1055 1060 tca ctt aag aac tac aga agg ctc tac gag gcg ctt gtc gag aac gtc 3389 Ser Leu Lys Asn Tyr Arg Arg Leu Tyr Glu Ala Leu Val Glu Asn Val 1065 1070 1075 aga tac aac ggc aac aag agg gag tac ctc gtt gaa ttc aat tcc atc 3437 Arg Tyr Asn Gly Asn Lys Arg Glu Tyr Leu Val Glu Phe Asn Ser Ile 1080 1085 1090 cgg gat gca gtt ggc ata atg ccc cta aaa gag ctg aag gag tgg aag 3485 Arg Asp Ala Val Gly Ile Met Pro Leu Lys Glu Leu Lys Glu Trp Lys 1095 1100 1105 1110 atc ggc acg ctg aac ggc ttc aga atg aga aag ctc att gaa gtg gac 3533 Ile Gly Thr Leu Asn Gly Phe Arg Met Arg Lys Leu Ile Glu Val Asp 1115 1120 1125 gag tcg tta gca aag ctc ctc ggc tac tac gtg agc gag ggc tat gca 3581 Glu Ser Leu Ala Lys Leu Leu Gly Tyr Tyr Val Ser Glu Gly Tyr Ala 1130 1135 1140 aga aag cag agg aat ccc aaa aac ggc tgg agc tac agc gtg aag ctc 3629 Arg Lys Gln Arg Asn Pro Lys Asn Gly Trp Ser Tyr Ser Val Lys Leu 1145 1150 1155 tac aac gaa gac cct gaa gtg ctg gac gat atg gag aga ctc gcc agc 3677 Tyr Asn Glu Asp Pro Glu Val Leu Asp Asp Met Glu Arg Leu Ala Ser 1160 1165 1170 agg ttt ttc ggg aag gtg agg cgg ggc agg aac tac gtt gag ata ccg 3725 Arg Phe Phe Gly Lys Val Arg Arg Gly Arg Asn Tyr Val Glu Ile Pro 1175 1180 1185 1190 aag aag atc ggc tac ctg ctc ttt gag aac atg tgc ggt gtc cta gcg 3773 Lys Lys Ile Gly Tyr Leu Leu Phe Glu Asn Met Cys Gly Val Leu Ala 1195 1200 1205 gag aac aag agg att ccc gag ttc gtc ttc acg tcc ccg aaa ggg gtt 3821 Glu Asn Lys Arg Ile Pro Glu Phe Val Phe Thr Ser Pro Lys Gly Val 1210 1215 1220 cgg ctg gcc ttc ctt gag ggg tac tca tcg gcg atg gcg acg tcc acc 3869 Arg Leu Ala Phe Leu Glu Gly Tyr Ser Ser Ala Met Ala Thr Ser Thr 1225 1230 1235 gaa caa gag act cag gct ctc aac gaa aag cga gct tta gcg aac cag 3917 Glu Gln Glu Thr Gln Ala Leu Asn Glu Lys Arg Ala Leu Ala Asn Gln 1240 1245 1250 ctc gtc ctc ctc ttg aac tcg gtg ggg gtc tct gct gta aaa ctt ggg 3965 Leu Val Leu Leu Leu Asn Ser Val Gly Val Ser Ala Val Lys Leu Gly 1255 1260 1265 1270 cac gac agc ggc gtt tac agg gtc tat ata aac gag gag ctc ccg ttc 4013 His Asp Ser Gly Val Tyr Arg Val Tyr Ile Asn Glu Glu Leu Pro Phe 1275 1280 1285 gta aag ctg gac aag aaa aag aac gcc tac tac tca cac gtg atc ccc 4061 Val Lys Leu Asp Lys Lys Lys Asn Ala Tyr Tyr Ser His Val Ile Pro 1290 1295 1300 aag gaa gtc ctg agc gag gtc ttt ggg aag gtt ttc cag aaa aac gtc 4109 Lys Glu Val Leu Ser Glu Val Phe Gly Lys Val Phe Gln Lys Asn Val 1305 1310 1315 agt cct cag acc ttc agg aag atg gtc gag gac gga aga ctc gat ccc 4157 Ser Pro Gln Thr Phe Arg Lys Met Val Glu Asp Gly Arg Leu Asp Pro 1320 1325 1330 gaa aag gcc cag agg ctc tcc tgg ctc att gag ggg gac gta gtg ctc 4205 Glu Lys Ala Gln Arg Leu Ser Trp Leu Ile Glu Gly Asp Val Val Leu 1335 1340 1345 1350 gac cgc gtt gag tcc gtt gat gtg gaa gac tac gat ggt tat gtc tat 4253 Asp Arg Val Glu Ser Val Asp Val Glu Asp Tyr Asp Gly Tyr Val Tyr 1355 1360 1365 gac ctg agc gtc gag gac aac gag aac ttc ctc gtt ggc ttt ggg ttg 4301 Asp Leu Ser Val Glu Asp Asn Glu Asn Phe Leu Val Gly Phe Gly Leu 1370 1375 1380 gtc tat gct cac aac agc tac tac ggt tac tac ggc tat gca agg gcg 4349 Val Tyr Ala His Asn Ser Tyr Tyr Gly Tyr Tyr Gly Tyr Ala Arg Ala 1385 1390 1395 cgc tgg tac tgc aag gag tgt gca gag agc gta acg gcc tgg gga agg 4397 Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser Val Thr Ala Trp Gly Arg 1400 1405 1410 gag tac ata acg atg acc atc aag gag ata gag gaa aag tac ggc ttt 4445 Glu Tyr Ile Thr Met Thr Ile Lys Glu Ile Glu Glu Lys Tyr Gly Phe 1415 1420 1425 1430 aag gta atc tac agc gac acc gac gga ttt ttt gcc aca ata cct gga 4493 Lys Val Ile Tyr Ser Asp Thr Asp Gly Phe Phe Ala Thr Ile Pro Gly 1435 1440 1445 gcc gat gct gaa acc gtc aaa aag aag gct atg gag ttc ctc aac tat 4541 Ala Asp Ala Glu Thr Val Lys Lys Lys Ala Met Glu Phe Leu Asn Tyr 1450 1455 1460 atc aac gcc aaa ctt ccg ggc gcg ctt gag ctc gag tac gag ggc ttc 4589 Ile Asn Ala Lys Leu Pro Gly Ala Leu Glu Leu Glu Tyr Glu Gly Phe 1465 1470 1475 tac aaa cgc ggc ttc ttc gtc acg aag aag aag tat gcg gtg ata gac 4637 Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala Val Ile Asp 1480 1485 1490 gag gaa ggc aag ata aca acg cgc gga ctt gag att gtg agg cgt gac 4685 Glu Glu Gly Lys Ile Thr Thr Arg Gly Leu Glu Ile Val Arg Arg Asp 1495 1500 1505 1510 tgg agc gag ata gcg aaa gag acg cag gcg agg gtt ctt gaa gct ttg 4733 Trp Ser Glu Ile Ala Lys Glu Thr Gln Ala Arg Val Leu Glu Ala Leu 1515 1520 1525 cta aag gac ggt gac gtc gag aag gcc gtg agg ata gtc aaa gaa gtt 4781 Leu Lys Asp Gly Asp Val Glu Lys Ala Val Arg Ile Val Lys Glu Val 1530 1535 1540 acc gaa aag ctg agc aag tac gag gtt ccg ccg gag aag ctg gtg atc 4829 Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro Pro Glu Lys Leu Val Ile 1545 1550 1555 cac gag cag ata acg agg gat tta aag gac tac aag gca acc ggt ccc 4877 His Glu Gln Ile Thr Arg Asp Leu Lys Asp Tyr Lys Ala Thr Gly Pro 1560 1565 1570 cac gtt gcc gtt gcc aag agg ttg gcc gcg aga gga gtc aaa ata cgc 4925 His Val Ala Val Ala Lys Arg Leu Ala Ala Arg Gly Val Lys Ile Arg 1575 1580 1585 1590 cct gga acg gtg ata agc tac atc gtg ctc aag ggc tct ggg agg ata 4973 Pro Gly Thr Val Ile Ser Tyr Ile Val Leu Lys Gly Ser Gly Arg Ile 1595 1600 1605 ggc gac agg gcg ata ccg ttc gac gag ttc gac ccg acg aag cac aag 5021 Gly Asp Arg Ala Ile Pro Phe Asp Glu Phe Asp Pro Thr Lys His Lys 1610 1615 1620 tac gac gcc gag tac tac att gag aac cag gtt ctc cca gcc gtt gag 5069 Tyr Asp Ala Glu Tyr Tyr Ile Glu Asn Gln Val Leu Pro Ala Val Glu 1625 1630 1635 aga att ctg aga gcc ttc ggt tac cgc aag gaa gac ctg cgc tac cag 5117 Arg Ile Leu Arg Ala Phe Gly Tyr Arg Lys Glu Asp Leu Arg Tyr Gln 1640 1645 1650 aag acg aga cag gtt ggt ttg agt gct tgg ctg aag ccg aag gga act 5165 Lys Thr Arg Gln Val Gly Leu Ser Ala Trp Leu Lys Pro Lys Gly Thr 1655 1660 1665 1670 tgacctttcc atttgttttc cagcggataa ccctttaact tccctttcaa aaactccctt 5225 tagggaaaga ccatgaagat agaaatccgg cggcgcccgg ttaaatacgc taggatagaa 5285 gtgaagccag acggcagggt agtcgtcact gccccgaggg ttcaacgttg agaagtt 5342 6 5339 DNA Hyperthermophilic archaeon 6 gcttgagggc ctgcggttat gggacgttgc agtttgcgcc tactcaaaga tgccggtttt 60 ataacggaga aaaatgggga gctattacga tctctccttg atgtggggtt tacaataaag 120 cctggattgt tctacaagat tatgggggat gaaagatgat cctcgacact gactacataa 180 ccgaggatgg aaagcctgtc ataagaattt tcaagaagga aaacggcgag tttaagattg 240 agtacgaccg gacttttgaa ccctacttct acgccctcct gaaggacgat tctgccattg 300 aggaagtcaa gaagataacc gccgagaggc acgggacggt tgtaacggtt aagcgggttg 360 aaaaggttca gaagaagttc ctcgggagac cagttgaggt ctggaaactc tactttactc 420 atccgcagga cgtcccagcg ataagggaca agatacgaga gcatggagca gttattgaca 480 tctacgagta cgacataccc ttcgccaagc gctacctcat agacaaggga ttagtgccaa 540 tggaaggcga cgaggagctg aaaatgctcg ccttcgacat tcaaactctc taccatgagg 600 gcgaggagtt cgccgagggg ccaatcctta tgataagcta cgccgacgag gaaggggcca 660 gggtgataac ttggaagaac gtggatctcc cctacgttga cgtcgtctcg acggagaggg 720 agatgataaa gcgcttcctc cgtgttgtga aggagaaaga cccggacgtt ctcataacct 780 acaacggcga caacttcgac ttcgcctatc tgaaaaagcg ctgtgaaaag ctcggaataa 840 acttcgccct cggaagggat ggaagcgagc cgaagattca gaggatgggc gacaggtttg 900 ccgtcgaagt gaagggacgg atacacttcg atctctatcc tgtgataaga cggacgataa 960 acctgcccac atacacgctt gaggccgttt atgaagccgt cttcggtcag ccgaaggaga 1020 aggtttacgc tgaggaaata acaccagcct gggaaaccgg cgagaacctt gagagagtcg 1080 cccgctactc gatggaagat gcgaaggtca catacgagct tgggaaggag ttccttccga 1140 tggaggccca gctttctcgc ttaatcggcc agtccctctg ggacgtctcc cgctccagca 1200 ctggcaacct cgttgagtgg ttcctcctca ggaaggccct atgagaggaa tgagctggcc 1260 ccgaacaagc ccgatgaaaa ggagctggcc agaagacggc agagctatga aggaggctat 1320 gtaaaagagc ccgagagagg gttgtgggag aacatagtgt acctagattt tagatgccat 1380 ccagccgata cgaaggttgt cgtcaagggg aaggggatta taaacatcag cgaggttcag 1440 gaaggtgact atgtccttgg gattgacggc tggcagagag ttagaaaagt atgggaatac 1500 gactacaaag gggagcttgt aaacataaac gggttaaagt gtacgcccaa tcataagctt 1560 cccgttgtta caaagaacga acgacaaacg agaataagag acagtcttgc taagtctttc 1620 cttactaaaa aagttaaggg caagataata accactcccc ttttctatga aataggcaga 1680 gcgacaagtg agaatattcc agaagaagag gttctcaagg gagagctcgc tggcatagta 1740 ttggctgaag gaacgctctt gaggaaagac gttgaatact ttgattcatc ccgcaaaaaa 1800 cggaggattt cacaccagta tcgtgttgag ataaccattg ggaaagacga ggaggagttt 1860 agggatcgta tcacatacat ttttgagcgt ttgtttggga ttactccaag catctcggag 1920 aagaaaggaa ctaacgcagt aacactcaaa gttgcgaaga agaatgttta tcttaaagtc 1980 aaggaaatta tggacaacat agagtcccta catgccccct cggttctcag gggattcttc 2040 gaaggcgacg gttcagtaaa caggttagga ggagtattgt tgcaacccag ggtacaaaga 2100 acgagtggaa gattaaactg gtgtcaaaac tgctctccca gcttggtatc cctcatcaaa 2160 cgtacacgta tcagtatcag gaaaatggga aagatcggag caggtatata ctggagataa 2220 ctggaaagga cggattgata ctgttccaaa cactcattgg attcatcagt gaaagaaaga 2280 acgctctgct taataaggca atatctcaga gggaaatgaa caacttggaa aacaatggat 2340 tttacaggct cagtgaattc aatgtcagca cggaatacta tgagggcaag gtctatgact 2400 taactcttga aggaactccc tactttgcca atggcatatt gacccataac tccctgtacc 2460 cctcaatcat catcacccac aacgtctcgc cggatacgct caacagagaa ggatgcaagg 2520 aatatgacgt tgccccacag gtcggccacc gcttctgcaa ggacttccca ggatttatcc 2580 cgagcctgct tggagacctc ctagaggaga ggcagaagat aaagaagaag atgaaggcca 2640 cgattgaccc gatcgagagg aagctcctcg attacaggca gagggccatc aagatcctgg 2700 caaacagcat cctacccgag gaatggcttc cagtcctcga ggaaggggag gttcacttcg 2760 tcaggattgg agagctcata gaccggatga tggaggaaaa tgctgggaaa gtaaagagag 2820 agggcgagac ggaagtgctt gaggtcagtg ggcttgaagt cccgtccttt aacaggagaa 2880 ctaacaaggc cgagctcaag agagtaaagg ccctgattag gcacgattat tctggcaagg 2940 tctacaccat cagactgaag tcggggagga gaataaagat aacctctggc cacagcctct 3000 tctctgtgag aaacggggag ctcgttgaag ttacgggcga tgaactaaag ccaggtgacc 3060 tcgttgcagt cccgcggaga ttggagcttc ctgagagaaa ccacgtgctg aacctcgttg 3120 aactgctcct tggaacgcca gaagaagaaa ctttggacat cgtcatgacg atcccagtca 3180 agggtaagaa gaacttcttt aaagggatgc tcaggacttt gcgctggatt ttcggagagg 3240 aaaagaggcc cagaaccgcg agacgctatc tcaggcacct tgaggatctg ggctatgtcc 3300 ggcttaagaa gatcggctac gaagtcctcg actgggactc acttaagaac tacagaaggc 3360 tctacgaggc gcttgtcgag aacgtcagat acaacggcaa caagagggag tacctcgttg 3420 aattcaattc catccgggat gcagttggca taatgcccct aaaagagctg aaggagtgga 3480 agatcggcac gctgaacggc ttcagaatga gaaagctcat tgaagtggac gagtcgttag 3540 caaagctcct cggctactac gtgagcgagg gctatgcaag aaagcagagg aatcccaaaa 3600 acggctggag ctacagcgtg aagctctaca acgaagaccc tgaagtgctg gacgatatgg 3660 agagactcgc cagcaggttt ttcgggaagg tgaggcgggg caggaactac gttgagatac 3720 cgaagaagat cggctacctg ctctttgaga acatgtgcgg tgtcctagcg gagaacaaga 3780 ggattcccga tggcgtcttc acgtccccga aaggggttcg gctggccttc cttgaggggt 3840 actcatcggc gatggcgacg tccaccgaac aagagactca ggctctcaac gaaaagcgag 3900 ctttagcgaa ccagctcgtc ctcctcttga actcggtggg ggtctctgct gtaaaacttg 3960 ggcacgacag cggcgtttac agggtctata taaacgagga gctcccgttc gtaaagctgg 4020 acaagaaaaa gaacgcctac tactcacacg tgatccccaa ggaagtcctg agcgaggtct 4080 ttgggaaggt tttccagaaa aacgtcagtc ctcagacctt caggaagatg gtcgaggacg 4140 gaagactcga tcccgaaaag gcccagaggc tctcctggct cattgagggg gacgtagtgc 4200 tcgaccgcgt tgagtccgtt gatgtggaag actacgatgg ttatgtctat gacctgagcg 4260 tcgaggacaa cgagaacttc ctcgttggct ttgggttggt ctatgctcac aacagctact 4320 acggttacta cggctatgca agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa 4380 cggcctgggg aagggagtac ataacgatga ccatcaagga gatagaggaa aagtacggct 4440 ttaaggtaat ctacagcgac accgacggat tttttgccac aatacctgga gccgatgctg 4500 aaaccgtcaa aaagaaggct atggagttcc tcaactatat caacgccaaa cttccgggcg 4560 cgcttgagct cgagtacgag ggcttctaca aacgcggctt cttcgtcacg aagaagaagt 4620 atgcggtgat agacgaggaa ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg 4680 actggagcga gatagcgaaa gagacgcagg cgagggttct tgaagctttg ctaaaggacg 4740 gtgacgtcga gaaggccgtg aggatagtca aagaagttac cgaaaagctg agcaagtacg 4800 aggttccgcc ggagaagctg gtgatccacg agcagataac gagggattta aaggactaca 4860 aggcaaccgg tccccacgtt gccgttgcca agaggttggc cgcgagagga gtcaaaatac 4920 gccctggaac ggtgataagc tacatcgtgc tcaagggctc tgggaggata ggcgacaggg 4980 cgataccgtt cgacgagttc gacccgacga agcacaagta cgatgccgag tactacattg 5040 agaaccaggt tctcccagcc gttgagagaa ttctgagagc cttcggttac cgcaaggaag 5100 acctgcgcta ccagaagacg agacaggttg gtttgagtgc ttggctgaag ccgaagggaa 5160 cttgaccttt ccatttgttt tccagcggat aaccctttaa cttccctttc aaaaactccc 5220 tttagggaaa gaccatgaag atagaaatcc ggcggcgccc ggttaaatac gctaggatag 5280 aagtgaagcc agacggcagg gtagtcgtca ctgccccgag ggttcaacgt tgagaagtt 5339 7 24 DNA Hyperthermophilic archaeon 7 ggattagtgc caatggaagg cgac 24 8 24 DNA Hyperthermophilic archaeon 8 gagggcgaag tttattccga gctt 24 9 324 DNA Hyperthermophilic archaeon 9 ggattagtgc caatggaagg cgacgaggag ctgaaaatgc tcgccttcga cattcaaact 60 ctctaccatg agggcgagga gttcgccgag gggccaatcc ttatgataag ctacgccgac 120 gaggaagggg ccagggtgat aacttggaag aacgtggatc tcccctacgt tgacgtcgtc 180 tcgacggaga gggagatgat aaagcgcttc ctccgtgttg tgaaggagaa agacccggac 240 gttctcataa cctacaacgg cgacaacttc gacttcgcct atctgaaaaa gcgctgtgaa 300 aagctcggaa taaacttcgc cctc 324 10 108 PRT Hyperthermophilic archaeon 10 Gly Leu Val Pro Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe 5 10 15 Asp Ile Gln Thr Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro 20 25 30 Ile Leu Met Ile Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val Ile Thr 35 40 45 Trp Lys Asn Val Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg 50 55 60 Glu Met Ile Lys Arg Phe Leu Arg Val Val Lys Glu Lys Asp Pro Asp 65 70 75 80 Val Leu Ile Thr Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys 85 90 95 Lys Arg Cys Glu Lys Leu Gly Ile Asn Phe Ala Leu 100 105 11 42 DNA Hyperthermophilic archaeon 11 gccatcaaga tcctggcaaa cagctactac ggttactacg gc 42 12 32 DNA Hyperthermophilic archaeon 12 gatggatcca acttctcaac gttgaaccct cg 32 13 46 DNA Hyperthermophilic archaeon 13 gaacatagtg tacctagatt ttagatccct gtacccctca atcatc 46 14 42 DNA Hyperthermophilic archaeon 14 gccgtagtaa ccgtagtagc tgtttgccag gatcttgatg gc 42 15 33 DNA Hyperthermophilic archaeon 15 atcgatatcc tcgacactga ctacataacc gag 33 16 46 DNA Hyperthermophilic archaeon 16 gatgattgag gggtacaggg atctaaaatc taggtacact atgttc 46 

What we claim is:
 1. A method for purifying DNA polymerase obtainable from a KOD1 strain which belongs to a Hyperthermophilic archaeon, comprising culturing recombinant host cells transfopled by a recombinant DNA expression vector that comprises a foreign DNA sequence inserted into a vector, wherein the foreign DNA sequence encodes the thermostable DNA polymerase derived from a KOD1 strain which bclongs to Hyperthermophilic archaeon, having an amino acid sequence of SEQ ID No. 1 and/or a nucleotide sequence of SEQ ID No. 5 and further (a) recovering the cultured recombinant host cells, lysing them and preparing the cell extract, and (b) removing the impurified proteins derived from recombinant host cells.
 2. An isolated DNA encoding a thermostable DNA polymerase which is a strain KOD1 Hyperthermophilic archaeon, wherein the polymerase has an amino acid sequence of SEQ ID No.
 1. 3. An isolated DNA encoding a thermostable DNA polymerase which is a strain KOD1 Hyperthermophilic archaeon, wherein the isolated DNA has a nucleotide sequence of SEQ ID No.
 5. 